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(57) Abstract 

The present invention provides an improved method and apparatus for translating a source language to a target language. The 
invention uses placeables (e.g., proper nouns, titles and names, dates, times, units and measurements, numbers, formatting information, such 
as tags or escape sequences, styles, graphics, hyperlinks) to assist a translator by not having to retype information that does not need to be 
translated and to provide conversions to the target locale if necessary like for speeds. 
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WO 99/57651 PCT/EP99/02959 

MACHINE-ASSISTED TRANSLATION TOOLS 
BACKGROUND OF THE INVENTION 



1. Field of the Invention 

The present invention relates to machine processing of text and 
language and, more particularly, to a method and apparatus including a 
software implementation for machine-assisted translation or machine 
translation. 

2. Discussion of the Related Technology 

Translation of text from one language to another is often a tedious task 
requiring the efforts of a skilled translator. Soon after the advent of 
computers, researchers began to use computers as an aid for natural language 
translation. The earliest machine translation (MT) systems relied on large 
bilingual dictionaries where entries for words of the source language (SL) 
gave one or more equivalents in the target language (TL). It quickly became 
apparent that dictionary rules for syntax and grammar were so complex that 
experts could not develop a comprehensive set of rules to describe the 
umzi2333& t rans lation have been abandoned. 

*! Ss 3 BBg »aBBi the world, multilingual cultures and multinational trade 
create an increasing demand for translation services. The demand for 
translation of commercial and technical documents represents a large and 
growing segment of the translation market. Examples of such documents are 
contracts, instruction manuals, forms, and computer software. Often when * 
a product or service is "localized" for a new market, a great deal of 
documentation must be translated, creating a need for cost-effective 
translation. Because commercial and technical information is often detailed 
and precise, accurate translations continue to be in demand. 

Machine translation (MT) systems are usually classified as either 
direct, transfer-based, or interlingua-based. In the direct approach, there are 
no intermediate representations between the source language and the target 
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language. The source language text is processed "directly" in order to 
transform it into the target text.. This process is essentially a word-to-word 
translation with some adjustments. This approach is not followed by any MT 
system at present due to a perceived weakness attributable to ignoring all 
aspects of the internal structure of sentences. 

In the transfer-based approach, information from the various stages of 
analysis from the source text is transferred to the corresponding stages of the 
generation of the target text. For example, transfer is achieved by setting up 
correspondence at the lexical level, at the grammatical level, or at the level 
of the structure built by the grammar, and so forth. The transfer method 
operates only on a particular pair of languages and, therefore, must be 
specifically and painstakingly created for each pair of languages. 

The interlingua-based approach depends upon an assumption that a 
suitable intermediate representation can be defined such that the source text 
can be mapped into the intermediate representation which can then be 
mapped into the target text. In principle, this approach is clearly attractive 
because, unlike the transfer-based approach, it is not necessary to build a 
separate transfer program for each pair of languages. However, it is not clear 
whether a truly language-independent intermediate representation can be 
devised. Current interlingua-based systems are much less ambitious about 
tfegg cfernn^ to the universality ox the intermediate representation- For a 
MgSr-cssafirr translation, it is c&a necessary to have a rrws to some 
psrticalar aspects of the so*H*ce asssi s&Eget languages. 

In the transfer-based approach, there have been some recent advances. 
In the development of mathematical and computational models of grammar, 
there is increasing emphasis on locating syntactic as well as semantic 
information directly with the lexical items by associating structures with the 
lexical items and defining operations for composing these objects. From this 
perspective, all the information particular to a language is encapsulated in the 
lexical items and the structures associated with them. Different languages 
will be distinguished at this level, but not with respect to the operations for 
composing these structures, which are the same for all languages. The idea, 
then, is to define all bilingual correspondence at this level. It remains to be 
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seen whether this approach can be carried out among a variety of different 
languages. 

Some existing MT systems require that documents be written in highly 
constrained texts. Such systems are useful for preparing manuals in 
different languages. Here, the system is really not translating a manual 
written in one natural language into a set of other natural languages, but 
rather is generating multilingual texts from a highly constrained text, thus 
avoiding many problems in conventional MT. 

Recently, research has focused on ways of using machines to assist 
human translators rather than to autonomously perform translations. This 
approach is referred to as machine-assisted human translation (MAHT).^ 
Systems are available that produce high-quality translation of business 
correspondence using pre-translated fragments with some translations filled 
in by human translators. An example of a machine-assisted translation tool 
is a translation memory (TM) system. Translation memory systems leave the 
creative work to the translator, however they can learn from the translator, 
and they actively support the translation process by automatically suggesting 
existing translations and terminology. A translation memory is a database 
th*± collects translations as they are performed, along with the source 
l^igrwEgag ^j|g|g afegfcs After a. number of translations have been performed 
.se in Tfrg* translation memory, tie translation memory can be 

-ff^^s^ to new translations where the new translations include 

identical or CTrnHflr source language text as had been included in the 
translation memory. 

The advantage of such a system is that it can, in theory, leverage 
existing MT technology to make the translator more efficient without 
sacrificing the traditional accuracy provided by a human translator. The 
System makes translations more efficient by ensuring that the translator 
never has to translate the same source text twice. While a translator works, 
translation memory operates in the background to 'learn 5 original sentences 
and their corresponding translations. In the process, this data may be linked 
into the neural network. Later, translation memory rapidly finds identical 
or similar sentences and automatically displays them as a working basis for 
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creating a new translation. Thus, translation memory ensures that no 
sentence need be translated twice. 

Translation memories are most useful when they are able to locate not 
only, identical matches, but also approximate or "fuzzy matches." Fuzzy 
matching facilitates retrieval of text that differs slightly in word order, 
morphology, case, or spelling. The approximate matching is necessary 
because of the large variety possible in natural language texts. Fuzzy 
matching to find sentences with similar content has seen its performance 
perfected by the implementation of neural network technology. The 
translator has the option of choosing among alternative translations in 
addition to the one automatically suggested by memory. Along with the 
source sentence and its translation, each translation unit can also- store 
information on users, dates and frequency of use, and classifying attributes 
and text fields. This information enables easy maintenance of translation 
memories, which naturally become quite large over time. 

Concordances are another tool commonly used by translators. 
Electronic concordances are files having text strings, i.e., words, phrases or 
sentences, that are matched with the context in which the word appeared in 
a psrticafer doctmigsxt When a translator is unsure of the meaning to be 
jsrssesx a. p gg ^ ac^g wurd , fee concordance can demonstrate how the word is 
TZFxP&in j^^roF *¥iTw* r% ±n ± co ntests This information allows for a more proper 
/saectk^ of trHHsiaSoBS to accurately reflect the meaning of a source 
fa agsrgg e document Electronic concordances include text searching software 
thsa: aBows the translator to extract all text strings in a library that include 
a desired word or phrase. The extracted texts strings can be examined 
quickly to gain a greater understanding of how a particular word or phrase . 
is used in context. 

Multilingual natural language processing represents a growing need 
and opportunity in the field of international commerce and communication. 
Machine-assisted translation tools are needed to make document translation 
more efficient and less costly. Furthermore, machine-assisted translation 
tools are needed that efficiently leverage the large amount of stored 
knowledge available as pre-translated commercial and technical documents. 

-4- 
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Specifically, a need exists for a translation memory tool that is language- 
independent and provides accurate, rapid fuzzy retrieval of pre-translated 
material. 

Up until now, text that was considered to be a placeable had to be 
translated and manually entered by the translator. Placeables are often re- 
used "as is" in the translated text or in a converted form. Examples of such, 
placeables are: proper nouns, titles and names, dates, times, units and 
measurements, numbers, formatting information, such as tags or escape 
sequences, styles, graphics, hyperlinks, cross-references, automatic fields in 
text, or any other kind of information that will not be translated but, rather, 
converted without knowledge about the context. The translation of placeables 
is time-consuming and can lead to errors when conversions must be made for 
things such as currency, e.g., dollar to yen and speed, e.g., miles per hour may 
to kilometers per hour. There is a need for a. program that identifies the text 
considered to be placeable, makes any necessary conversions, and inserts the 
placeable into the target text. 

SQMM&ISr OF THE INVENTION 
Tl^e 55a r ^g55KEgg £gs i 3&3n prcvkiss an improved method and apparatus for 
fe^ggS a sfeg z spsa E ce language into a target language. The invention uses 
pIsKsahies to ^^ i : a translator by facilitating the automatic or semiautomatic 
replacement of placeables in the target language and making any necessary 
conversions according to the target locale, e.g., "German - Standard". A 
placeable as used herein is a term that designates data that does not require 
translation into a target language or, in some cases, data types that are 
particularly suitable for semiautomatic replacement (e.g., proper nouns, titles * 
and names, formatting information, such as tags or escape sequences, styles, 
graphics) and data requiring a translation that does not change the context 
of the data (e.g., physical and currency units, time zones, date formats, 
hyperlinks etc.). In addition, a placeable may be more complex and advanced. 
For example, a placeable could be determined by specialized dictionaries 
and/or the context or environment information of the entire information 
designated for translation, e.g., data in the chemical environment, automotive 
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environment, music lyrics, legal environment. The context or environment 
information would then decide how certain terms are translated. A 
source placeable identifier may be used to identify the placeables in the source 
information, e.g., source locale, based on source concordance relating to the 
context and environment of the placeable. In translation memories, the 
placeable may be converted into a language-independent format, e.g., meta- 
representation. The language-independent format allows the translation 
memory to convert the placeable into any target language because the format 
is common to all locales. After conversion to the independent format, the 
placeable can be automatically or semiautomatically placed in the target 
translation. A target placeable converter is used to convert the placeables 
into target information , e.g., target locale, based on target concordance 
relating to the context arid environment of the placeable. 

A system, according to the invention, may identify a placeable and 
determine its type in order to facilitate subsequent handling of the placeable, 
typically to facilitate a decision on placing, converting, or translating the 
placeafcfe- The identification of a placeable and deter mina tion of its type may 
be sscosspKsissed ir? a rule-based process. In addition, the identification and 
determsnaiiGix process may be performed by or with the assistance of a finite 
state - pr ?? ? cfalne such as table lookup functions or a character by character 
determination. 

In database-driven TMs using this invention, there is a high potential 
to reduce the amount of storage space needed to store the source and target 
units in pairs by storing the units as templates, or skeletons, together with 
the placeable information. 

An object of the invention is to reduce the effort required of a 
translator to translate source information into target information by 
eliminating the need to manually type or move the placeable to a translation 
by allowing placeables to be inserted in the target information and to perform 
any desired conversions by means of a target placeable converter. 

Another object of the invention, is to reduce the amount of time or 
effort required to translate source text by automatically converting placeables, 
e.g., dates, measurement units into target text for insertion. 

-6- 
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Another object of the invention is to reduce errors that may occur 
when a translator is manually converting measurement units in a source text 
to a target text by automatically converting placeable data. 

Another object of the invention is to reduce the amount, of time or 
effort required to translate source text by automatically identifying formatting 
codes and inserting the codes into a target text. 

Another object of the invention is to reduce the amount of time or 
effort required to translate a source text into a target text by automatically 
translating hypertext links. 

Another object of the invention is to convert the placeables into a 
language-independent format. 

An object of the invention is to automatically change the appearance 
of placeable elements if appropriate, for example, by converting measurement 
units, date formats, currency values and units, titles and names, etc. 

An object of the invention is to semiautomatically insert placeable 
elements at a user-defined position in the target text upon interaction from 
the user, e.g„, upon one or more keystrokes, upon one or more spoken 
commands, upon isoase clicks, etc^ when translating source information. 

A-n <3q§ect c^i^izreesztson is to automatically insert placeable elements, 
Trttfct&egaB^.qf irfaeBigc m^^rrsl orotfegrTna^^ng-computable information, 
tissi ajjaa r isas ss a dss g to determine the position for the insertion without 
user mtoanserL. 

An object of the invention is that it can be used with manual 
translation or with a translation memory. 



BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 shows an embodiment of the invention; 

FIG. 2 shows another embodiment of the invention; 

FIG. 3 shows another embodiment of the invention; 

FIG. 4 shows another embodiment of the invention; 

FIG. 5 shows another embodiment of the invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
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The present invention may be carried out by a variety of 
commercially available language translation computer software programs. 
The invention can work with a translation memory, however, it is not a 
requirement. Advantageously, the system will support at least two 
languages. 

First, the system can receive input source translation information, 
such as text data or voice data, that may be entered or retrieved in various 
ways, (e.g., data file, scanned data, voice recording, voice dictation, etc.). 
The program may divide the source translation information into linguistic 
forms or data translation units. This can be accomplished by segmenting 
the source translation information into words, sentences, or paragraphs. 
This process can be system-designated or user-designated. 

Manual Translation 

FIG. 1 shows a flow chart illustrating a process according to the 
invention when a translator is not using a translation memory. Initially, 
input source translation information may be divided into segments, such as 
sentences. Elements of the segments are provided to the processor at 
location (110). The system determines whether an element is considered a 
piaceable (120). ~Wh3e translating the text, the system will advise the 
trp^hrt^issst a. dsrsa gjensent is a piaceable and will allow the translator 
to h2ss tsas-piacsssis saserted into the tar g e t text (130). At this paint, the 
system may also de termin e the type at piaceable in order to assist the 
translator. The type information may be provided to the translator in any 
suitable format, such as by a leading signal, color, font change, etc. In this 
method, the translator may determine where the piaceable should be 
inserted, e.g., upon one or more keystrokes, upon one or more spoken 
commands, upon mouse click(s), etc. If the piaceable requires a conversion 
to the target information, this system may be set to automatically convert 
the piaceable based on type, when the user selects to drop the piaceable 
into the target text (140). This conversion is performed by a target 
piaceable converter according to location information of the target output 
(e.g., target locale, target dictionaries based on target 
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environment/context). The manual translation may be performed in a 
Windows environment. 

Translation Memory 

Translation memories are used for reference purposes when 
5 translating information. FIG. 2 shows a commercially available software 
program for managing reference material for translation when using a 
translation memory. The reference material may be collections of text, 
normally in two or more languages, whereby previously translated source 
text units are associated with target text units. The input designated for 

10 translation will be referred to as source translation information. As 

mentioned above, translation memories are most useful when they are able 
to locate not only identical matches, but also approximate or "fuzzy 
matches." Fuzzy matching facilitates retrieval of text that differs slightly 
from the source translation information. The translation memory can 

15 provide information to the translator that indicates how close the retrieved 
suggested information matches the source translation information. This 
information could be disseminated in the form of a numerical 
representation such as a percentage. In the event that a unit of text (Le^ 
source transtatkai msbrmatiGn) to be translated is identical, or very 

20 similar* to a source text unit occurring in the translation memory and 

which has been a ccurate ly tressiased ax an earlier time, a retrieval system 
can show as a reference the translation stored in the translation memory 
as target text. The translator can then copy this reference unit and modify 
it to fit the new source translation information. In prior systems, if a 

25 placeable, occurring in the text to be translated, was different from a 

corresponding element of the translation memory source text, it would be 
. necessary for a translator to manually transfer and, if necessary, translate 
. or convert placeable data into the target information. 

The user interface of the translation software provides a program 

30 window (200) for displaying different items for the translator. This 

particular example shows the translation software program window (210) 
and a word processor's program window (240) at the same time so that 
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the user can view the entire source translation information, or smaller 
units of interest, during the translation process. Item (240) can also be 
configured to display the final target information, e.g., the translated text, 
during the translation process. Item (210) shows an area (220) where the 
translation program may display the linguistic form or data translation 
unit of the source translation information designated for translation. In 
addition, window (210) may display some suggested translations for the 
linguistic form when using a translation memory tool (230). 

FIG. 3 and FIG. 5 illustrate the invention when a translator is using 
a translation memory. First, the linguistic units of source translation 
information may further be divided into tokens or source elements. Once 
a source element is identified as a placeable (310), its type is determined 
(330) (i.e., date, time, link, etc.), and the placeable may then be converted 
into a language independent format (340), such as a meta-representation, 
or directly converted into a target language or locale. The meta- 
representation (420, 520) allows the system to convert the placeable into 
any target language because the format (meta-representation) is common 
to all locales. The mg fy-repx esentation can be converted according to any 
target locale to produce a l ar^et placeable (360). At this point, the 
placeable may be in serte d into the target language and any conversion may 
be made automatically or semiautomatically. 

Consider the following sentence: "A man, called Mr. Miller, left his 
apartment on the 25 th of January in a car that is capable of driving at 
speeds above 160 mph." A machine translation program would face great 
difficulties in determining whether "A mam called Mr. Miller on the 
phone" or whether "A man" is the same person as "Mr. Miller." This 
question can only be answered by evaluating the context. In other words, 
if one were to look at the word "called" alone, it would be virtually 
impossible to translate the sentence correctly. However, if the entire token 
(410) is considered: "A man, called Mr. Miller," then you (or a machine) 
could come up with a meaningful translation. FIG. 3 shows how the 
invention processes this sentence and how a placeable would be treated the 



-10- 



SUBSTITUTE SHEET (RULE 26) 



WO 99/57651 PCT/EP99/02959 

first time it was identified. Three placeables are identified: Mr. Miller, 
25 th of January, and 160 mph. 



5 



Identified Placeable 


Classification & 
language-independent 
formatting 


Transformation 
to be inserted in the 
target text 


Mr. Miller 


Placeable/Name w/Title 


Herr Miller 


25 th of January 


Placeable/Date-longform 


25. Januar 


160 mph 


Placeable /number w/unit 


260 km/h 



When the system is presented with the foregoing text as part of an 
input of text to be translated (e.g., source translation information), the 
system will first segment the text, preferably into sentences. The 

10 foregoing sentence may then advantageously be tokenized. According to 
one esmodimeni of &e invention, it can be tokenized into elements 
c cs i ^M<%5 to fee words or phrases in the source translation sentence. 
The tokenizing process will consider whether the elements may be 
identified as placeables according to a rule-based query and/or with the use 

15 of finite state tools such as look-up tables. Next, the type of the element 
identified as a placeable will be determined. This determination may also 
be accomplished by a rule-based inquiry and/or with the use of a finite 
state process such as by tables. The output of the tokenizer will include 
the non-placeable elements and an indication of the type of any placeable 

20 elements. This output may be provided to a translation memory in order 
to locate any identical or similar segments that have been previously 
translated. The system may propose a translation based on the previously 
translated target text located in the translation memory and direct 
placement with or without conversion of the placeable elements. The 

25 system may advantageously be implemented by software in a general 
purpose computer such as a personal computer. 
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The determination of a placeable may be a one or two step method 
using a rule-based system/a finite state system. The one-step method may 
also be accomplished by determining the type using a rule-based system 
that, views the entire token. For example: is this token a date, is this 
token a proper noun, is this token a hyperlink? The two step-method may 
first determine whether the token is a placeable and then determine its 
type. This may be accomplished using a finite state process that examines 
each character of a token one at a time until a determination is reached. 

One of the identifying features of a placeable is that its meaning is 
not likely to vary by context. Such placeables may include types that may 
be directly used in a translation, for example, numbers or graphics. Other 
types of placeables may be used after conversion, such as numbers coupled 
with units. For example, a placeable appearing as "62 miles per hour" may 
be converted to "100 kilometers per hour". Such a conversion is 
distinguished from a translation by its formulaic nature. A formulaic 
conversion is suitable in sites±ions where context is not likely to affect the 
translation. 

The foll ow ing ^mr^ ifinstraies the translation of text including a 
similar placeable in another source text. The translation memory system 
includes the following source text unit plus its translation from the 
reference fileCs): 

Translation Memory Source Text Unit: "A man, called Mr. Smith, left his 
apartment on the 1st of April in a car that is capable of driving at speeds 
above 100 mph." 

Translation Memory Target Text Unit (German): "Em Mann, namens 
Herr Smith, verliess sein Apartment am 1. April in einem Auto, das 
schneller als 160 km/h fahren kann." 

New Text Unit to be translated is: W A man, called Mr. Miller, left his 
apartment on the 25 th of January in a car that is capable of driving at 
speeds above 160 mph." 

As shown in FIG. 5., the new source text unit may be divided into 
elements (510). The placeables may then be classified and converted to a 
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language-independent format (520). When the software is able to correlate 
three placeables in the Translation Memory Target Text Unit with 
placeables in both the Translation Memory Source Text Unit and the New 
Text to be translated, a translation memory system using this invention 
can determine and propose the location for inserting the placeables 
automatically, that is, without any user interaction. In the above example, 
the system will be capable of determining that: 

I. The only difference between the Text Unit (to be translated) 
and the Translation Memory Source Text Unit (a similar text 
that has been translated earlier) is found in three tokens. 

II. All three tokens are placeables. 

III. The type {i.e., date, speed, name) of the placeable tokens are 
the same in Translation Memory Source Text Units and the 
text to be translated. 

IV. In the Old Target Text Unit, exactly the same number and 
type of placeable tokens can be found. The translation 
system may propose to reuse the previous translation 
(Translation Memory Target Text Unit) and replace the 
placeable tokens with the new name (=Mr. Miller), the new 
date (—25 th January), and the new speed (=160 mph). 

V. The software may convert certain parts of the placeables 
depending on the type. In this example: "Herr Miller", "25. 
Januar", and "280 km/h" (330). 

Storing Placeables 

There are two types of translation memory systems on the market 
today: reference file driven TMs and database driven TMs. Reference file 
driven TMs keep the source text and target text in two different locations, 
aligning the two by keeping a list of reference pointers to each other. In 
reference file driven systems, all source text units ever written (or 
otherwise created), and all translations thereof, may be physically stored 
and kept in files. In database driven TMs using this invention, there is a 
high potential to conserve data storage space by simply storing the text 
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We claim: 

1. . A method for processing source information comprising the steps of: 

parsing input source information into elements; 
identifying placeable elements by a predetermined criteria; 
designating said placeable elements by type. 

2. A method for processing source information according to claim 1, 
further comprising 'the step of : 

determining a source locale - 

3. A method for processing source information according to claim 2, 
further comprising the step of : 

applying a source placeable identifier to determine said type of said 
element. 

4. A method for processing source information according to claim 1, 
further comprising the step of : 

determining a target locale. 

5. A method for processing source information according to claim 1, 
further comprising the step of : 

applying a target placeable converter to convert said type of 
said element. 
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1 6. A method for processing source information according to claim 2, 

2 further comprising the step of : 

3 .■- applying said source locale to determine said element by type. 

1 7. A method for processing source information according to claim 1, 

2 further comprising the step of : 

3 converting said element into a language-independent format. 

1 8. A method for processing source information according to claim 1, 

2 further comprising the steps of : 

3 determining whether said placeable is a proper noun; 

4 placing said placeable directly into a target output. 

1 9. A method for processing source information according to claim 1, 

2 further comprising the steps of : 

3 determining whether said placeable is a date; 

4 converting said date into a target information according to a 

5 target locale information. 

1 10. A method for processing source information according to claim 1, 

2 further comprising the steps of : 

3 determining whether said placeable is a proper noun; 

4 converting said placeable into a language independent format. 
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1 11. A method for processing source information according to claim 1, 

2 further comprising the steps of : 

3 determining whether said placeable is a proper noun; 

4 converting said placeable into a meta-representation. 

1 12. A method for processing source information according to claim 1, 

2 further comprising the steps of : 

3 determining whether said placeable is a date; 

4 converting said placeable into a language independent format. 

1 13. A method for processing source information according to claim 1, 

2 further comprising the step of : 

3 determining whether said placeable requires conversion. 

1 14. A method for processing source information according to claim 1, 

2 further comprising the step of : 

3 determining whether said placeable is a proper noun. 

1 15. A method for processing source information according to claim 1, 

2 ' further comprising the step of : 

3 determining whether said placeable is a date. 

1 16. A method for processing source information according to claim 1, 

2 further comprising the step of : 

3 determining output requirement for conversions. 
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. 17. A computer driven language processing system for processing source 
information comprising; 
a parser; 

an element identifier, connected to an output of said parser; 

a type designator, connected to an output of said element identifier. 

18. A computer driven language processing system for processing source 
information comprising: 

a parser for parsing source information into elements; 
an element identifier identifying placeable elements by a 
predetermined criteria; 

a type designator for designating said placeable elements by type. 

19. A method for processing source information comprising the steps of: 
segmenting input source information into elements; 

identifying placeable elements by a predetermined criteria; 
designating said placeable elements by type. 

20. A method for processing source information according to claim 8, 
further comprising the step of : 

comparing said placeable elements to a data set. 

21. A method for processing source information according to claim 8, 
further comprising the step of : 

comparing said placeable elements by type to a data set. 
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